Interplay between secondary and tertiary structure formation 
in protein folding cooperativity 
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The folding cooperativity of proteins is characterized by the rela- 
tive population of intermediate states at the transition temperature: 
while two-state transitions exhibit two energetic peaks character- 
izing the folded and unfolded ensembles, downhill folders show 
a unimodal distribution of energetic states without any barrieniil 
This aspect of finite-size thermodynamic transitions can provide 
insight into the folding mechanism, but energetic populations can 
be difficult to measure. Therefore, protein folding cooperativity 
is often probed using the calorimetric criterion, ^ which quantifies 
the sharpness of the specific heat curve. From a computer simula- 
tion point of view, however, evaluating the probability density p{E) 
remains an appealing idea, as it would provide an unambiguous 
description of the thermodynamic transition. Although currently 
untractable atomistically due to sampling limitations, high resolu- 
tion coarse-grained models offer an alternative approach.— While 
cutting down significantly on computational time, they can retain 
much chemical detail, and some are even able to fold simple pep- 
tides with no prior knowledge of the native state. In this Commu- 
nication we study the link between thermodynamics and structure 
for helical peptides using such a coarse-grained model, ^ details of 
which can be found in the Supporting Information. 

To characterize the thermodynamics of finite-size systems, it has 
been shown that a microcanonical analysis, based on the entropy 
S{E), is often more informative than a canonical analysis. ^'^ Mi- 
crocanonically, S{E) = kBlnQ.{E) where Q.{E) is the density of 
states. One remarkable feature of such a description is its ability to 
unambiguously distinguish between discontinuous (;. e., two-state) 
and continuous (;. e., downhill) transitions. Indeed, two-state transi- 
tions exhibit a depletion of intermediate energetic states leading to 
local convexity in the entropy. This can be best observed by defin- 
ing the quantity AS{E) = Jif(E) — S{E), where the first term is the 
(double-)tangent to S{E) in the transition region.^"'" The method 
relies on accurate measurements of the density of states, calculated 
here using the Weighted Histogram Analysis Method. ' ' All order 
parameters will be analyzed as a function of energy. 

We first examine a short a-helix of sequence (AAQAA)3. '^ The 
density of states reveals a discontinuous transition with nonzero la- 
tent heat AQ ( |[figure][I][]I[ a) between the folded and unfolded 
ensembles (representative conformations at different energies are 
shown). Phase coexistence is associated with a backbending^ of 
the microcanonical inverse temperature T^^' = dS/dE while the 

corresponding canonical relation T^^{{E)cRa), where (£')can is the 
average energy, is monotonic l |[figure][l][]l[ b). The radius of gyra- 
tion ( |[figure][I][]I[ c) quickly drops inside the coexistence region, 
indicating that most structural rearrangements happen within this 
energy interval. We will assume the hydrogen-bond and side chain 
energies, iihb and E^c, to be suitable proxies of secondary struc- 
ture and tertiary contacts, repectively. It proves instructive to look 
at their energetic rates, dEjib/di? and dEs^/dE l|[figure][l][]l[ d): 
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Figure 1. (AAQAA)3. (a) AS{E); error bars reflect the variance of the data 
points (1 (7 interval), (b) inverse temperature from a canonical rj.^„'{(£)caii) 
and microcanonical T^^' (-E) = dS/dE analysis, where (-E)can is the canoni- 
cal average energy, (c) radius of gyration Rg{E) with the error of the mean, 
(d) rates of H-bond and side chain energies dZJdb/dS, <iEf,^/dE. Vertical 
lines deUmit the transition region; its width corresponds to the microcanon- 
ical latent heat AQ. 



even though dEsc/dE stays virtually flat over the energy range con- 
sidered, the sharp peak in dE^^/dE indicates that most secondary 
structure forms within the coexistence region. 

Elongating the sequence to (AAQAA)i5 leads to a qualitative 
change in the folding mechanism. The ground state again forms 
a single a-helix, but the transition is now continuous: as can be 
seen in | [figure] [2] []2| (a) and Figure S2 (Supporting Information), 
there is a single transition point and the latent heat is zero. The ra- 
dius of gyration l |[figure][2][]2[ b) features a sharp minimum above 
the transition point, indicative of a chain collapse into "maximally 
compact non-native states." Upon lowering the energy further, 
the chain will reorganize from such non-native states into the heli- 
cal state. In doing so, the rate of tertiary contact formation dE^c/dE 
dips below zero l |[figure][2][]2[ c), hence there is an energetic 
penalty associated with tertiary rearrangements. Hydrogen-bond 
formation occurs over a large energetic interval as indicated by the 
broad maximum in dE^^^,/dE. The absence of any two-state signal 
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Figure!. (KkQKK)^^. (a) SS(E'). (b) radius of gyration «„(£). (c) rates of 
H-bond and side chain energies dfihb/dZ?, dSsc/dS. Horizontal arrows in- 
dicate where most secondary structure forms and where non-native tertiary 
contacts dissolve. The vertical line marks the transition point. 



is consistent with theoretical models of the helix-coil transition:— 
the energetic cost of breaking a hydrogen-bond is outweighed by 
the conformational entropy gained. Further analysis indicates on 
average two helices at the transition point. 

While of similar length, the 73 amino acid de novo three-helix 
bundle a3D (PDB code: 2A3D) '2, does show a discontinuous tran- 
sition, see I [figure] [3] [] 3 1 (a) . Representative conformations sam- 
pled in the two coexisting ensembles stand as good proxies of the 
ground state and unfolded state, unlike for the downhill folding 
transition of (AAQAA)i5. The radius of gyration again shows a 
minimum above the transition l |[figure][3][]3[ b), and folding once 
more starts from maximally compact non-native states. Notice that 
secondary structure formation and the loss of non-native tertiary 
contacts l |[figure][3][]3[ c) are sharp and predominantly localized 
within the coexistence region. The three helices form inside the 
same energetic interval due to the inter-helical cooperativity im- 
printed in the sequence— Chain compaction is due to strong side 
chain-side chain interactions. 

Overall, we can correlate thermodynamic features with struc- 
tural information from the three peptides studied here. While 
(AAQAA)3 is too short for tertiary interactions to play any role, 
the transitions associated with (AAQAA)i5 and the bundle a3D are 
both associated with tertiary rearrangements. These two examples 
suggest that independently of its nature, the folding transition is 
driven by the loss of non-native tertiary contacts (i. e., the region 
where d^sc/dii < 0) — reminiscent of the heteropolymer collapse 
model. On the other hand, secondary structure formation has 
shown very different signals: (AAQAA)3 and 0!3D exhibit sharp 
peaks where (AAQAA)ig displays a broad maximum. As shown in 
|[figure][2][]2| (c), secondary structure formation in a downhill fold- 
ing peptide occurs over a much broader interval compared to the 
loss of non-native tertiary contacts, whereas these two quantities 
are contained within the same narrow interval for a two-state pep- 
tide l |[figure][3][]3[ c). Cooperative secondary and tertiary structure 
formation has been proposed as a mechanism for two-state folding 
from lattice simulations'^ and theoretical models.'- Beyond this, 
our results also highlight the interplay between secondary structure 
formation and the loss of non-native tertiary contacts. Our conclu- 
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Figure 3. Three-helix bundle a3D. (a) AS{E). (b) radius of gyration 
Rg{E). (c) rates of H-bond and side chain energies dEi^f,/dE, diJsc/dii. 



sions on the thermodynamics of the short-, long-, and bundled helix 
are compatible with the calorimetric criterion, which gives, respec- 
tively, d = 0.78, 0.52, 0.78 for the calorimetric ratio.l Note that 
Ghosh and Dilli^ predicted 5 = 0.72 for the similar bundle a3C. 
However, the main strength of a microcanonical analysis stems 
from an access to fine aspects of thermodynamic information that 
are otherwise difficult to obtain either canonically or from experi- 
ments. It thus stands as a complementary tool to gain further in- 
sight. 
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Abstract 



Protein folding cooperativity is defined by the nature of tfie finite-size tliermodynamic transition exhibited upon folding: two-state transitions 
show a free energy barrier between the folded and unfolded ensembles, while downhill folding is barrierless. A microcanonical analysis, 
where the energy is the natural variable, has shown better suited to unambiguously characterize the nature of the transition compared to its 
canonical counterpart. Replica exchange molecular dynamics simulations of a high resolution coarse-grained model allow for the accurate 
evaluation of the density of states, in order to extract precise thermodynamic information, and measure its impact on structural features. The 
method is applied to three helical peptides: a short helix shows sharp features of a two-state folder, while a longer helix and a three-helix 
bundle exhibit downhill and two-state transitions, respectively. Extending the results of lattice simulations and theoretical models, we find 
that it is the interplay between secondary structure and the loss of non-native tertiary contacts which determines the nature of the transition. 



